Speaker-basis Accent Clustering Using Invariant Structure Analysis and the Speech Accent Archive
نویسندگان
چکیده
English is the only language available for global communication and is used by 1.5 billions of speakers. It is also known to have a large diversity of pronunciation due to the influence of speakers’ mother tongue, called accents. Our project aims at creating a global and speaker-basis map of English accents to be used in learning World Englishes as well as research studies of World Englishes [1, 2]. Creating the map, i.e., speaker-basis accent clustering, mathematically requires a distance matrix in terms of accents among all the speakers considered, and technically requires a method of predicting the accent distance between any pair of the speakers by using their speech samples only. In [3, 4], our first trials were presented, where invariant structure analysis was effectively used for feature extraction. However, some technical problems were found through the experiments and in this paper, recent progresses are presented with additional explanation on the invariant structure, which were omitted in [3, 4] due to space limitations. Use of the invariant structure and Support Vector Regression shows a striking performance of distance prediction in a speaker-pair-open mode but the performance is not sufficient in a speaker-open mode.
منابع مشابه
Automatic prosodic segmentation by F0 clustering using superpositional modeling
In this paper, we propose an automatic method for detecting accent phrase boundaries in Japanese continuous speech by using F0 information. In the training phase, hand labeled accent patterns are parameterized according to a superpositional model proposed by Fujisaki, and assigned to some clusters by a clustering method, in which accent templates are calculated as centroid of each cluster. In t...
متن کاملSpeech recognition for multiple non-native accent groups with speaker-group-dependent acoustic models
In this paper, the recognition performance for non-native English speech with two different kinds of speaker-groupdependent acoustic models is investigated. The approaches for creating speaker groups include knowledge-based grouping of non-native speakers by their first language, and the automatic clustering of speakers. Clustering is based on speakerdependent acoustic models in speaker Eigensp...
متن کاملForeign accent conversion through voice morphing
We present a voice morphing strategy that can be used to generate a continuum of accent transformations between a foreign speaker and a native speaker. The approach performs a cepstral decomposition of speech into spectral slope and spectral detail. Accent conversions are then generated by combining the spectral slope of the foreign speaker with a morph of the spectral detail of the native spea...
متن کاملForeign accent classification using source generator based prosodic features
Source Generator Based Prosodic Features John H.L. Hansen and Levent M. Arslan Robust Speech Processing Laboratory Duke University Department of Electrical Engineering Box 90291, Durham, North Carolina 27708-0291 ABSTRACT Speaker accent is an important issue in the formulation of robust speaker independent recognition systems. Knowledge gained from a reliable accent classi cation approach could...
متن کاملFOREIGN ACCENT CLASSIFICATION USING SOURCE GENERATOR BASED PROSODIC FEATURES - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Speaker accent is an important issue in the formulation of robust speaker independent recognition systems. Knowledge gained from a reliable accent classification approach could improve overall recognition performance. In this paper, a new algorithm is proposed for foreign accent classification of American English. A series of experimental studies are considered which focus on establishing how s...
متن کامل